SEQUENCE LISTING 



(1) GENERAL INFORMATION: 



(i) 




(ii) 

(iii) 
(iv) 



APPLICANT : 

(A) NAME: Max-Planck-Gesellschaf t zur Fo^rderung der 

Wissenschaf ten e.V. 

(B) STREET: none 

(C) CITY: Berlin 

(E) COUNTRY: DE 

( F) POSTAL CODE (ZIP) : none 

(A) NAME: CNRS 

(B) STREET: rue de la cardonille 

(C) CITY: Montpellier Cedex 05 

(E) COUNTRY: FR 

(F) POSTAL CODE (ZIP) : 34094 

TITLE OF INVENTION: Nucleic acid( molecules coding for mammalian 
tumor suppressor proteins and methods for their isolation 



NUMBER OF SEQUENCES: 17 



/ 



COMPUTER READABLE FORM: / 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 (EPO) 



(2) INFORMATION FOR SEQ ID NO: \i 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: "2790 base' pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : si/ngle 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA/to mRNA 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 54 2/ .2545 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

GAATTCGGGA GAGCAAGCGG G^VTCTCCTG GGCGCCGTCA TGGCTGCTTA GGCTGCGCTG 60 

CCTGCGGATC GCGGATCCGG GATCGGAGAT CTGACGGCGA CGCCTGAGTC CGGCTAGGGT 120 

AGGTCTGGGT TGGAGTCTGT GCCTGCTTCC TTGGCGTGTG GTTGTTCCTG CTTGATTGCT 180 

TCAGCGTGCC ATCGGCTTCG / TATTTGCATA GGAGTCAGAG GAGTTAATCT TGTCTCCTCG 240 

AAGATAGACT CTCATGGTTTI ATGATCCATC TCTGTGAGAA GACTTTATTT GTCTGTCTCT 300 
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TCTCACAGGT TTGAGTCTTC AGACTTCTAC AGAACTCCAT AATATCTGCC TCACAGCTGG 

CTTTCCTGCT CTCACAGAAG ATACCCAGCT ATTGTGCTCT GGATCTCTCC TGGCTGCTAG 

GCTGTAGCGC TGCCTTTCTG GAGTCAGGCT GTAGTGACTC CCCACCTTCT TTCTGTCTGG 

GCTTAAATGG CACAGCAGTT CCTCAGCACA TCTGAAGAAG AAAGTGTGAG AACCAAAGGC 

C ATG GCT CCA TTC CGC TGT CAA AAA TGT GGC AAG TCC TTC GTC ACC 
Met Ala Pro Phe Arg Cys Gin Lys Cys Gly Lys Ser Phe Val Thr 
15 10 15 

CTG GAG AAG TTC ACC ATT CAC AAT TAT TCC CAC TCC AGG GAG CGC CCA 
Leu Glu Lys Phe Thr lie His Asn Tyr Ser His Ser Arg Glu Arg Pro 
20 25 30 

TTC AAG TGC TCG AAG GCT GAG TGT GGC AAA GCC TTC GTC TCC AAG TAT 
Phe Lys Cys Ser Lys Ala Glu Cys Gly Lys Ala Phe Val Ser Lys Tyr 
35 40 45 

! t 

AAG CTG ATG AGA CAC ATG GCC ACA CAC TCG CCA CAG AAG ATT CAC CAG 
Lys Leu Met Arg His Met Ala Thr His Ser Pro Gin Lys lie His Gin 
50 55 60 

TGT ACT CAC TGT GAG AAG ACA TTC AAC CGG AAG GAC CAC CTG AAG AAC 
Cys Thr His Cys Glu Lys Thr Phe Asn Arg Lys Asp His Leu Lys Asn 
65 70 75 

CAC CTC CAG ACC CAC GAT CCC AAC AAG ATC TCC TAC GCG TGT GAC GAT 
His Leu Gin Thr His Asp Pro Asn Lys lie Ser Tyr Ala Cys Asp Asp 
80 85 90 95 

TGC GGC AAG AAG TAC CAC ACC ATG CTG GGC TAC AAG AGG CAC CTG GCC 
Cys Gly Lys Lys Tyr His Thr Met Leu Gly Tyr Lys Arg His Leu Ala 
100 ** 105 110 

CTG CAC TCG GCG AGC AAT GGC GAT CTC ACC TGT GGG GTG TGC ACC CTG 
Leu His Ser Ala Ser Asn Gly Asp Leu Thr Cys Gly Val Cys Thr Leu 
115 120 125 

GAG CTG GGG AGC ACC GAG GTC CTG CTG GAC CAC CTC AAG TCT CAC GCG 
Glu Leu Gly Ser Thr Glu Val Leu Leu Asp His Leu Lys Ser His Ala 
130 135 140 

GAA GAA AAG GCC AAC CAG GCA CCC AGG GAG AAG AAA TAC CAG TGC GAC 
Glu Glu Lys Ala Asn Gin Ala Pro Arg Glu Lys Lys Tyr Gin Cys Asp 
145 150 155 

CAC TGT GAT AGA TGC TTC TAC ACC CGG AAA GAT GTG CGT CGC CAC CTG 
His Cys Asp Arg Cys Phe Tyr Thr Arg Lys Asp Val Arg Arg--His Leu 
160 165 170 175 

GTG GTC CAC ACA GGA TGC AAG GAC TTC CTG TGT CAG TTC TGT GCC CAG 
Val Val His Thr Gly Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin 
180 185 190 



AGA TTT GGG CGC AAA GAC CAC CTC ACT CGT CAC ACC AAG AAG ACC CAC 
Arg Phe Gly Arg Lys Asp His Leu Thr Arg His Thr Lys Lys Thr His 
195 200 205 
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TCC CAG GAG CTG ATG CAA GAG AAT ATG CAG GCA GGA GAT TAC CAG AGC 1210 
Ser Gin Glu Leu Met Gin Glu Asn Met Gin Ala Gly Asp Tyr Gin Ser 
210 215 220 

AAT TTC CAA CTC ATT GCG CCT TCA ACT TCG TTC CAG ATA AAG GTT GAT 12 53 

Asn Phe Gin Leu lie Ala Pro Ser Thr Ser Phe Gin lie Lys Val Asp 
225 230 235 

CCC ATG CCT CCT TTC CAG CTA GGA GCG GCT CCC GAG AAC GGG CTT GAT 130 6 

Pro Met Pro Pro Phe Gin Leu Gly Ala Ala Pro Glu Asn Gly Leu Asp 
240 245 250 255 

GGT GGC TTG CCA CCC GAG GTT CAT GGT CTA GTG CTT GCT GCC CCA GAA 135 4 

Gly Gly Leu Pro Pro Glu Val His Gly Leu Val Leu Ala Ala Pro Glu 
260 265 270 

GAA GCT CCC CAA CCC ATG CCG CCC TTG GAG CCT TTG GAG CCT TTG GAG 14 02 

Glu Ala Pro Gin Pro Met Pro Pro Leu Glu Pro Leu Glu Pro Leu Glu 
275 230 , 285; 

CCT TTG GAG CCT TTG GAG CCG ATG CAG TCT TTG GAG CCT TTG CAG CCT 14 50 

Pro Leu Glu Pro Leu Glu Pro Met Gin Ser Leu Glu Pro Leu Gin Pro 
290 295 300 

TTG GAG CCG ATG CAG CCT TTG GAG CCA ATG CAG CCT TTG GAG CCG ATG 14 93 

Leu Glu Pro Met Gin Pro Leu Glu Pro Met Gin Pro Leu Glu Pro Met 
305 310 315 

CAG CCT TTA GAG CCT TTG GAG CCT CTG GAG CCG ATG CAG CCT TTG GAG 154 6 

Gin Pro Leu Glu Pro Leu Glu Pro Leu Glu Pro Met Gin Pro Leu Glu 
320 325 330 335 

CCG ATG CAG CCT TTG GAG CCT ATG CAG CCA ATG CTG CCA ATG CAG CCA 15 94 

Pro Met Gin Pro Leu Glu Pro Met Gin Pro Met Leu Pro Met Gin Pro 
340 345 350 

ATG CAG CCA ATG CAG CCA ATG CAG CCA ATG CTG CCA ATG CAG CCA ATG 164 2 

Met Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Gin Pro Met 
355' 360 365 

CTG CCA ATG CAG CCA ATG CAG CCA ATG CAG CCA ATG CTG CCA ATG CCA ' 1690 

Leu Pro Met Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Pro 
370 375 380 

GAG CCG TCT TTC ACT CTG CAC CCT GGC GTA GTT CCC ACC TCT CCT CCC 17 33 

Glu Pro Ser Phe Thr Leu His Pro Gly Val Val Pro Thr Ser Pro Pro 
335 390 395 

CCA ATT ATT CTT CAG GAG CAT AAG TAT AAT CCT GTT CCT ACC -TCA TAT 17 8 6 

Pro lie lie Leu Gin Glu His Lys Tyr Asn Pro Val Pro Thr Ser Tyr 
400 405 410 415 

GCC CCA TTT GTA GGC ATG CCC GTC AAA GCA GAT GGC AAG GCC TTT TGC 18 34 

Ala Pro Phe Val Gly Met Pro Val Lys Ala Asp Gly Lys Ala Phe Cys 
420 • 425 430 

AAC GTG GGT TTC TTT GAG GAA TTT CCT CTG CAA GAG CCT CAG GCG CCT 183 2 

Asn Val Gly Phe Phe Glu Glu Phe Pro Leu Gin Glu Pro Gin Ala Pro 
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435 



440 445 
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465 470 
PPT GCC TCT CTG GAG ATT TCC TCC CTA TTG GGG TTT TGG CAG CTC CCC 
So Sa Ser 2S Gl« He Ser Ser Leu Leu Gly Phe Trp Gin Leu Pro 
480 485 

PPT CCT ACT CCC CAG AAT GGC TTT GTG AAT AGC ACC ATC CCT GTG GGG 
Pro Pro Sr Pro Gin Asn Gly Phe Val Asn Ser Thr lie Pro Val Gly 



500 
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515 



rrA CCG CCA CTG CCG CCG CCA CCA CCG CTG CCA CTG CCA CAG CCA CTG 
Pro Pro Pro Leu Pro Pro Pro Pro Pro Leu Pro Leu Pro Gin Pro Leu 



530 535 



CCA GTG CCA CAG CCA CTA CCA CAG CCA CAG ATG CAG CCA CAG TTT CAG 
Pro Sal S Gin Pro Leu Pro Gin Pro Gin Met Gin Pro Gin Phe Gin 

TTG CAG ATC CAG CCC CAG ATG CAG CTA CCA CAG CTG CTG CCG CAA CTG 
HI Sn ne Gin Pro Gin Met Gin Leu Pro Gin Leu Leu Pro Gin Leu 

560 565 
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595 600 
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610 615 
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" 625 630 63d 
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640 645 650 

CAA ATA GCA GGA CTC GTC TAT AAG AAA TGG ACA GTT TAG TTCCTCTTCT 

Gin He Ala Gly Leu Val Tyr Lys Lys Trp Thr Val 

660 665 

TGTTAGCTTA CTCTGTAGTT TCTTCTTCTT GTTGCCCATT GTGTAGCTTT ATAGAGTGTG 



1930 



1978 



2026 



2074 



2122 



2170 



2218 



2266 



2314 



2362 



2410 



2458 



2506 



2555 



2615 
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ACGCTATTGA TGTCTCCATT TTTTAAAGTG AATTTAAATG TACTGTTCAA TATTTTTCAT 2 67 5 

GTGATGTTGT TCCAATGTGA GTTACGACTT CATTTATCTT AAAG AC AAAA CTGGTTGTCA 2735 
GTCATATCTG ACAGAAGAAA GAAATCACTG TGTAACCAAG CCATATAGCG GCCGC 2 7 90 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 668 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala Pro Phe Arg Cys Gin Lys Cys Gly Lys Ser Phe Val Thr Leu 
! 5 10 ' \ 15 

Glu Lys Phe Thr He His Asn Tyr Ser His Ser Arg Glu Arg Pro Phe 
20 25 30 

Lys Cys Ser Lys Ala Glu Cys Gly Lys Ala Phe Val Ser Lys Tyr Lys 
35 40 45 

Leu Met Arg His Met Ala Thr His Ser Pro Gin Lys He His Gin Cys 
50 55 60 

Thr His Cys Glu Lys Thr Phe Asn Arg Lys Asp His Leu Lys Asn His 
65 70 75 80 

Leu Gin Thr His Asp Pro Asn Lys He Ser Tyr Ala Cys Asp Asp Cys 
85 " 90 95 

Glv Lys Lys Tyr His Thr Met Leu Gly Tyr Lys Arg His Leu Aia Leu 
100 105 HO 

His Ser Ala Ser Asn Gly Asp Leu Thr Cys Gly Val Cys Thr Leu Glu 
115 120 125 

Leu Gly Ser Thr Glu Val Leu Leu Asp His Leu Lys Ser His Ala Glu 
130 135 140 

Glu Lys Ala Asn Gin Ala Pro Arg Glu Lys Lys Tyr Gin Cys Asp His 
145 150 155 160 

Cys Asp Arg Cys Phe Tyr Thr Arg Lys Asp Val Arg Arg His Leu Val 
165 170 175 

Val His Thr Gly Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin Arg 
180 185 190 

Phe Gly Arg Lys Asp His Leu Thr Arg His Thr Lys Lys Thr His Ser 
195 ' 200 205 



Gin Glu Leu Met Gin Glu Asn Met Gin Ala Gly Asp Tyr Gin Ser Asn 
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210 



215 



220 



Phe Gin Leu lie Ala Pro Ser Thr Ser ?he Gin He Lys Val Asp Pro 
225 230 235 240 



Met Pro Pro Phe Gin Leu Gly Ala Ala Pro Glu Asa Gly Leu Asp Gly 

250 25d 



245 



Gly Leu Pro Pro Glu Val His Gly Le 



u Val Leu Ala Ala Pro Glu Glu 



260 



265 



270 



Ala Pro Gin Pro Met Pro Pro Leu Glu Pro Leu Glu Pro Leu Glu Pro 
275 280 285 

Leu Glu Pro Leu Glu Pro Met Gin Ser Leu Glu Pro Leu Gin Pro Leu 
290 295 300 

Glu Pro Met Gin Pro Leu Glu Pro Met Gin Pro Leu Glu Pro Met Gin 
305 310 315 320 

' \ 

Pro Leu Glu Pro Leu Glu Pro Leu Glu Pro Met Gin Pro Leu Glu Pro 
325 ' 330 335 

Met Gin Pro Leu Glu Pro Met Gin Pro Met Leu Pro Met Gin Pro Met 
340 345 350 

Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Gin Pro Met Leu 
355 360 365 

Pro Met Gin Pro Met Gin Pro Met Gin Pro Met Leu Pro Met Pro Glu 
370 375 380 

Pro Ser Phe Thr Leu His Pro Gly Val Val Pro Thr Ser Pro Pro Pro 
385 390 395 400 



lie lie Leu Gin Glu His Lys Tyr Asn Pro Val Pro Thr Ser Tyr Ala 
405 410 415 

Pro Phe Val Gly Met Pro Val Lys Ala Asp Gly Lys Ala Phe Cys Asn 
420 425 430 

Val Gly Phe Phe Glu Glu Phe Pro Leu Gin Glu Pro Gin Ala Pro Leu 
435 440 445 

Lys Phe Asn Pro Cys Phe Glu Met Pro Met Glu Gly Phe Gly Lys Val 
450 455 460 



Thr Leu Ser Lys Glu Leu Leu Val Asp Ala Val Asn lie Ala lie Pro 
465 470 475 480 



Ala Ser Leu Glu He Ser Ser Leu Leu Gly Phe Trp Gin Leu Pro Pro 
485 490 495 

Pro Thr Pro Gin Asn Gly Phe Val Asn Ser Thr He Pro Val Gly Pro 
500 505 510 

Gly Glu Pro Leu Pro His Arg He Thr Cys Leu Ala Gin Gin Gin Pro 
515 520 525 
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Pro Pro Leu Pro Pro Pro Pro Pro Leu Pro Leu Pro Gin Pro Leu Pro 
530 535 540 

Val Pro Gin Pro Leu Pro Gin Pro Gin Met Gin Pro Gin Phe Gin Leu 
545 550 555 560 

Gin lie Gin Pro Gin Met Gin Leu Pro Gin Leu Leu Pro Gin Leu Gin 
565 570 575 

Pro Gin Gin Gin Pro Asp Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu 
580 ' 585 590 

Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu Pro Glu 
595 600 605 

Pro Glu Pro Glu Pro Glu Glu Glu Gin Glu Glu Ala Glu Glu Glu Ala 
610 615 620 

Glu Glu Gly Ala Glu Glu Gly Ala Glu Pro Glu Ala Gin Ala Glu Glu 
625 630 635 640 

Glu Glu Glu Glu Glu Glu Ala Glu Glu Pro Gin Pro Glu Glu Ala Gin 
645 650 655 

lie Ala Gly Leu Val Tyr Lys Lys Trp Thr Val * 
660 665 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 36 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 
ATGTCTCGAG GCCTTTGCGG CCGCTATANN NNNNNN 
(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 




(iii) HYPOTHETICAL : YES 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
AGGCCATGG 

(2) I N FORMAT I ON FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 



(ix) FEATURE: 

(A) NAME /KEY : - '■ \ 

(B) LOCATION: 3 

(D) OTHER INFORMATION: /hote= "N at position 3 is A or G . " 



V (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

tj CCNCCATGG 

^ (2) INFORMATION FOR SEQ ID NO: 6: 

3 (i) SEQUENCE CHARACTERISTICS: 
W (A) LENGTH: 9 amino acids 

M* (B) TYPE: amino acid 

fn (C) STRANDEDNESS: single 

'■fj (D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

His Ser Arg Glu Arg Pro Phe Lys Cys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : linear 



(iii) HYPOTHETICAL: YES 
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p 



(ix) FEATURE: 

(A) NAME/KEY : Cross-links 

(B) LOCATION: 2 

(D) OTHER INFORMATION : / note- "X at position 2 is S or T 

(ix) FEATURE: 

(A) NAME/ KEY : Cross-links 

(B) LOCATION: 7 

(D) OTHER INFORMATION : / note= "X at position 7 is F or Y 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

His Xaa Gly Glu Lys Pro Xaa Xaa Cys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
O (B) TYPE: amino acid 

.p (C) STRANDEDNESS : single 

Ifl J (D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

His Ser Pro Gin Lys 
1 5 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(iii) HYPOTHETICAL: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Lys Lys Trp Thr 
1 



(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 35 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE : other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotid" 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 



GTGATGGCGG CCGCCATTCC GCTGTCAAAA ATGTG 



35 



(2) INFORMATION FOR SEQ ID NO: 11: 

{i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: -11: \ 

CCGCGCCTCG AGGGTCTTCT TGGTGTGACG ' 30 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc ^ "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12: 

GCGGCCGCAG AGCCGTCTTT CACTC 2 5 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: other nucleic acid 

(A) DESCRIPTION: /desc = "oligonucleotide" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

CCGCGCCTCG AGAACTGTCC ATTTCTTATA GAC 33 



(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 18 base pairs 
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(3) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AT AG C AG T G A GTGCTGTG 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GTTTCTTTTC AGGGACTC 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL- NO 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 803. .2192 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
CGGTTCTTTC AATTCAGAAT TTGTTTTAGG TTCTGTTATT GCATAGATTT GCATACCTGT 60 

TTTATGGTAT TTTAATACTG TTGGTTTTAA AAAATACCAT TTCCTCTGAG TGCTGTTCTG 120 

AATATATTAT GTAAGCAATT TTGTGTGTTC TTTTTTTTCC ACTTGCATAA AGCAGGGGAA 180 

AAGTTGAGAG TTTTTCTTAA TCCAGTCCCA AGTAGGACAA AGGATATGAG TGTTTAAAGA 24 0 

TCATCTATTA AAATGCATGA AAAAAC AC T A GAAAATCTCC TGTGCACATC GCCAGTCGTG 300 

TGTGTGCTCT AGAAGTGAA3 TTCAGGGGGT AACATAATGG AGGAATGTTT TCCTAGCTTC 3 60 

ATTCCCTGAC GATGTACAAG GTCTCTTCTC ACAGGTTTGA ATCTTCAGAC AAACTTCTGG 4 20 
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GAGGACTGGG AGGACTCGGT CCCTGCCTCG CAGCAGATGT TCCCTGTCAC TCAGTAGCCA 4 SO 

ATCCGGGGGA CCCAGGACAT GCCCCAGCTA TAGTGATGCA GATTACCTTT CTGGTCCTGA 54 0 

ATCGCACCTG TGCCTCGAGA CTTTCTCCCC TCAGCTTGAG ACTGCATGTA AACTGGGATG 600 

TGTGAAAGCA GGAAGCAAAG CTAGTGACAG CTGAGAGGTC CATGTCTGGG TAGAACCAGG 660 

CCCACGATGC TGCCTCTCCC GTGGTCTGGA GTTCAGCTGC AGGGACTCTG CTGATTGGCC 7 20 

CAGCACCATC GTTCTGTTTG TGCTTAAATG GCACAGCATT TGGTCAGCAC ATCTGAAAAG 730 

GAAGGTGTGA GAAGCAAAGC CC ATG GCC ACG TTC CCC TGC CAG TTA TGT GGC 8 32 

Met Ala Thr Phe Pro Cys Gin Leu Cys Gly 
15 10 

AAG ACG TTC CTC ACC CTG GAG AAG TTC ACG ATT CAC AAT TAT TCC CAC 8 80 

Lys Thr Phe Leu Thr Leu Glu Lys Phe Thr lie His Asn Tyr Ser His 
15 20 f j 25 

TCC AGG GAG CGG CCG TAC AAG TGT GTG CAG CCT GAC TGT GGC AAA GCC 928 
Ser Arg Glu Arg Pro Tyr Lys Cys Val Gin Pro Asp Cys Gly Lys Ala 
30 35 40 



m TTT GTT TCC AGA TAT AAA TTG ATG AGG CAT ATG GCT ACC CAT TCT CCC 97 6 

Ci Phe Val Ser Arg Tyr Lys Leu Met Arg His Met Ala Thr His Ser Pro 

t4 45 50 55 

CAG AAA TCT CAC CAG TGT GCT CAC TGT GAG AAG ACG TTC AAC CGG AAA 1024 

JS Gin Lys Ser His Gin Cys Ala His Cys Glu Lys Thr Phe Asn Arg Lys 

S 60 65 70 

■t; GAC CAC CTG AAA AAC CAC CTC CAG ACC CAC GAC CCC AAC AAA ATG GCC 1072 

Asp His Leu Lys Asn His Leu Gin Thr His Asp Pro Asn Lys Met Ala 
€J 75 8 0 8 5 90 

TTT GGG TGT GAG GAG TGT GGG AAG AAG TAC AAC ACC ATG CTG GGC TAT 1120 

Phe Gly Cys Glu Glu Cys Gly Lys Lys Tyr Asn Thr Met Leu Gly Tyr 
95 100 105 

AAG AGG CAC CTG GCC CTC CAT GCG GCC AGC AGT GGG GAC CTC ACC TGT 1163 

Lys Arg His Leu Ala Leu His Ala Ala Ser Ser Gly Asp Leu Thr Cys 
110 115 120 

GGG GTC TGT GCC CTG GAG CTA GGG AGC ACC GAG GTG CTA CTG GAC CAC 1216 

Gly Val Cys Ala Leu Glu Leu Gly Ser Thr Glu Val Leu Leu Asp His 
125 130 135 

CTC AAA GCC CAT GCG GAA GAG AAG CCC CCT AGC GGA ACC AAG GAA AAG 12 64 

Leu Lys Ala His Ala Glu Glu Lys Pro Pro Ser Gly Thr Lys Glu Lys 
140 145 150 



AAG CAC CAG TGC GAC CAC TGT GAA AGA TGC TTC TAC ACC CGG AAG GAT 
Lys His Gin Cys Asp His Cys Glu Arg Cys Phe Tyr Thr Arg Lys Asp 
155 * 160 165 170 



1312 
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GTG CGA CGC CAC CTG GTG GTC CAC ACA GGA TGC AAG GAC TTC CTG TGC 
Val Arg Arg His Leu Val Val His Thr Gly Cys Lys Asp ?he Leu Cys 
175 180 185 

CAG TTC TGT GCC CAG AGA TTT GGG CGC AAG GAT CAC CTC ACC CGG CAT 
Gin Phe Cvs Ala Gin Arg ?he Gly Arg Lys Asp His Leu Thr Arg His 
190 195 200 

ACC AAG AAG ACC CAC TCA CAG GAG CTG ATG AAA GAG AGC TTG CAG ACC 
Thr Lys Lys Thr His Ser Gin Giu Leu Met Lys Glu Ser Leu Gin Thr 
205 210 215 

GGA GAC CTT CTG AGC ACC TTC CAC ACC ATC TCG CCT TCA TTC CAA CTG 
Gly Asp Leu Leu Ser Thr Phe His Thr lie Ser Pro Ser Phe Gin Leu 
220 225 230 

AAG GCT GCT GCC TTG CCT CCT TTC CCT TTA GGA GCT TCT GCC CAG AAC 
Lys Ala Ala Ala Leu Pro Pro Phe Pro Leu Gly Ala Ser Ala* Gin Asn 
235 240 245 250 

GGG CTT GCA AGT AGC TTG CCA GCT GAG GTC CAT AGC CTC ACC CTC AGT 
Gly Leu Ala Ser Ser Leu Pro Ala Glu Val His Ser Leu Thr Leu Ser 
255 260 265 

CCC CCA GAA CAA GCC GCC CAG CCT ATG CAG CCG CTG CCA GAG TCC CTG 
Pro Pro Glu Gin Ala Ala Gin Pro Met Gin Pro Leu Pro Glu Ser Leu 
270 275 280 

GCC TCC CTC CAC CCC TCG GTA TCC CCT GGC TCT CCT CCG CCA CCC CTT 
Ala Ser Leu His Pro Ser Val Ser Pro Gly Ser Pro Pro Pro Pro Leu 
285 290 295 

CCC AAT CAC AAG TAC AAC ACC ACT TCT ACC TCA TAC TCC CCA CTT GCA 
Pro Asn His Lys Tyr Asn Thr Thr Ser Thr Ser Tyr Ser Pro Leu Ala 
300 305 310 

AGC CTG CCC CTC AAA GCA GAT ACT AAA GGT TTT TGC AAT ATC AGT TTG 
Ser Leu Pro Leu Lys Ala Asp Thr Lys Gly Phe Cys Asn lie Ser Leu 
315 320 325 330 

TTT GAG GAC TTG CCT CTG CAA GAG CCT CAG TCA CCT CAA AAG CTC AAC 
Phe Glu Asp Leu Pro Leu Gin Glu Pro Gin Ser Pro Gin Lys Leu Asn 
335 340 345 

CCA GGT TTT GAT CTG GCT AAG GGA AAT GCT GGT AAA GTA AAC CTG CCC 
Pro Gly Phe Asp Leu Ala Lys Gly Asn Ala Gly Lys Val Asn.Leu Pro 
350 355 360 

AAG GAG CTG CCT GCA GAT GCT GTG AAC CTA ACA ATA CCT GCC TCT CTG 
Lys Glu Leu Pro Ala Asp Ala Val Asn Leu Thr lie Pro Ala Ser Leu 
365 370 375 



GAC CTG TCC CCC CTG TTG GGC TTC TGG CAG CTG CCC CCT CCT GCT ACC 
Asp Leu Ser Pro Leu Leu Gly Phe Trp Gin Leu Pro Pro Pro Ala Thr 



85 



380 335 390 



CTGGAAGATG TTTTAAGAAG CATTTTAAAT GTCAGTtACA ATATGAGAAA GATTTGGAAA 
ACGAGACTGG GACTATGGCT TATTCAGTGA TGACTGGCTT GAGATGATAA GA 

(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 63 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY : linear 

(iij MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

Met Ala Thr Phe Pro Cys Gin Leu Cys Gly Lys Thr Phe Leu Thr Leu 
15 10 15 

Glu Lys Phe Thr He His Asn Tyr Ser His Ser Arg Glu Arg Pro Tyr 
20 25 30 

Lys Cys Val Gin Pro Asp Cys Gly Lys Ala Phe Val Ser Arg Tyr Lys 
35 40 45 

Leu Met Arg His Met Ala Thr His Ser Pro Gin Lys Ser His-,Gln Cys 

55 60 



50 



Ala His Cys Glu Lys Thr Phe Asn Arg Lys Asp His Leu Lys Asn His 
65 70 ^5 80 



2032 



2030 



2128 



CAA *AT ACC TTT GGG AAT AGC ACT CTT GCC CTG GGG CCT GGG GAA TCT 
Gin Asn Thr Phe Gly Asn Ser Thr Leu Ala Leu Gly Pro Gly Glu Ser 
395 400 405 410 

TTG CCC CAC AGG TTA AGC TGT CTG GGG CAG CAG CAG CAA GAA CCC CCA 
Leu Pro His Arg Leu Ser Cys Leu Gly Gin Gin Gin Gin Glu Pro Pro 
415 420 425 

CTT GCC ATG GGC ACT GTG AGC CTG GGC CAG CTC CCC CTG CCC CCC ATC 
Leu Ala Met Gly Thr Val Ser Leu Gly Gin Leu Pro Leu Pro Pro He 
430 435 440 

CCT CAT GTG TTC TCA GCT GGC ACT GGC TCT GCC ATC CTG CCT CAT TTC 217 6 

Pro His Val Phe Ser Ala Gly Thr Gly Ser Ala He Leu Pro His Phe 
445 450 455 

CAT CAT GCA TTC AGA T AATTGATTTT TAAAGTGTAT TTTTCGTATT { 222 2 
p His His Ala Phe Arg 
13 4 60 

/■*■• t\ rn t* t't 1 t\ 7\ a ""p t n t ■ >■ t\ r a a t a T (z A (i A A A GATTTGGAAA 2282 



2334 



Leu Gin Thr His Aso Pro Asn Lys Met Ala Phe Gly Cys Glu Glu Cys 
85 90 95 



86 



Gly Lys Lys Tyr Asn Thr Met Leu Gly Tyr Lys Arg His Leu Ala Leu 



His Ala Ala Ser Ser Gly Asp Leu 
115 



Thr Cys Gly Val Cys Ala Leu Glu 



120 125 



Leu Gly Ser Thr Glu Val Leu Leu Asp His Leu Lys Ala His Ala Glu 
130 135 140 

Glu Lvs Pro Pro Ser Gly Thr Lys Glu Lys Lys His Gin Cys Asp His 
145 " 150 155 

Cys Glu Arg Cys Phe Tyr Thr Arg Lys Asp Val Arg Arg His Leu Val 
165 l" 70 

Val His Thr Gly Cys Lys Asp Phe Leu Cys Gin Phe Cys Ala Gin Arg 
180 185 I 90 

Phe Gly Arg Lys Asp His Leu Thr Arg His Thr Lys Lys Thr, His Ser 
195 200 205 

Gin Glu Leu Met Lys Glu Ser Leu Gin Thr Gly Asp Leu Leu Ser Thr 
210 215 220 

Phe His Thr lie Ser Pro Ser Phe Gin Leu Lys Ala Ala Ala Leu Pro 
225 230 235 



Pro Leu Gly Ala Ser Ala Gin Asn Gly Leu Ala Ser Ser Leu 
245 2 50 255 

Pro Ala Glu Val His Ser Leu Thr Leu Ser Pro Pro Glu Gin Ala Ala 



Pro Phe 

245 



260 



265 210 



Gin Pro Met Gin Pro Leu Pro Glu Ser Leu Ala Ser Leu His Pro Ser 
275 



280 285 



Val S-r Gly Ser Pro Pro Pro Pro Leu Pro Asn His Lys Tyr Asn 

290 295 300 



Th- Thr Ser Thr Ser Tyr Ser Pro Leu Ala Ser Leu Pro Leu Lys Ala 
305 310 315 

Aso Thr Lys Gly Phe Cys Asn He Ser Leu Phe Glu Asp Leu Pro Leu 
325 330 335 

Gin Glu Pro Gin Ser Pro Gin Lys Leu Asn Pro Gly Phe Asp Leu Ala 



340 



34 5 350, 



Lys Gly Asn Ala Gly Lys Val Asn Leu Pro Lys Glu Leu Pro Ala Asp 



355 



360 365 



Ala Val Asn Leu Thr He Pro Ala Ser Leu Asp Leu Ser Pro Leu Leu 
370 



375 380 



87 




Gly Phe Trp Gin Leu Pro Pro Pro Ala Thr Gin Asn Thr Phe Gly Asn 



385 390 



S.r Thr Leu Ala Leu Gly Pro Gly Glu Ser Leu Pro His Arg Leu Ser 

410 H x J 



405 



C ys Leu Gly Gin Gin Gin Gin Glu Pro Pro Leu Ala Met Gly Thr Val 



420 



425 



Ser Leu Gly Gin Leu Pro Leu Pro Pro He Pro His Val Phe Ser Ala 

435 440 
Gly Thr Gly Ser Ala He Leu Pro His Phe His His Ala Phe Arg 



450 



455 



